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media, said efflux media comprising an ABCA1 acceptor, and measuring the amount of 
substrate that associates with said ABCA1 acceptor in each of said portions of said cells; 
and 

d) comparing the level of efflux of substrate from said first portion of said 
cells to the level of efflux of substrate from said second portion of said cells, wherein a 
change in the level of efflux indicates that said test compound affects the activity of the 
ABCA1 protein. 

0^ The method of claim 1 wherein said ABCA1 acceptor is apo-AI, apo- 

ATV, apo-CI, apo-CII, apo-CHI, apo-E, apo-E2, apo-E3, apo-E4, or a synthetic 
amphipathic peptide representing the alpha-helical domain of an apoprotein. 

^\ sffifT The method of claim 1 wherein the amount of substrate that associates 
with an acceptor for ABCA1 is measured by determining the amount of labeled 
substrate that appears in the medium of the cells. 



Remarks 

Claims 1-17 stand rejected under 35 U.S.C. §112, second paragraph, as allegedly 
being indefinite in the use of the term "ABCA1". Applicants respectfully traverse this 
rejection. In the Luciani reference, use of the term "ABC1" is made to refer to the 
present gene/protein as used in the present invention. Applicants intend exactly the 
same gene/protein when they use the term "ABCA1" in the present application. Simply 
put, sometime after the publication of the Luciani paper, it was decided by those who 
work with this gene family to rename the instant gene from ABC1 to ABCA1, as 
evidenced by Dean et al., Genome Research, 11:11 56-66 (2001). As such, Applicants 
merely refer to the instant gene/protein by its present name, rather than its original 
name, as determined by those of skill in the art. 

Claims 1-4 and 9 stand rejected under 35 U.S.C. §102 as allegedly being 
anticipated by Lawn as evidenced by Francis. Applicants respectfully traverse this 
rejection. In order for substrate efflux to occur at optimal levels in most instances, it is 
necessary for an acceptor molecule to be present. See, for example, page 6, lines 28-30, 
and page 9, lines 13-23, of Applicants' specification. The Apo-A-1 protein used in 
Francis and Lawn is not a "test compound" within the scope of that term as used in the 
claims, but is the acceptor molecule for the cholesterol efflux carried out by ABCA1 . 
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Applicants have amended claim 1 herewith to make it clear that the acceptor molecule is 
a component of the assay separate from the test compound, and requiring the presence of 
both an acceptor molecule and a test compound in the practice of amended invention 
claim 1. As a result of this amendment, claims 9 and 10 have been canceled herewith as 
redundant. 

Claims 1, 5, 6, 9, and 15 stand rejected under 35 U.S.C. §102 as allegedly being 
anticipated by Becq as evidenced by Applicants' specification. Applicants respectfully 
traverse this rejection. This rejection has been rendered moot by incorporating the 
limitations of cancelled claim 10 into claim 1. 

A Notice of Allowance is courteously solicited. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

In the Claims : 

Claims 1,11, and 13 have been amended as follows: 

I . A method of determining the ability of a test compound to affect the 
activity of ABC A 1 protein, said method comprising the steps of: 

a) introducing labeled substrate into ABCA1 -expressing cells; 

b) adding a test composition comprising said test compound to a first portion 
of said cells, and adding a control composition to a second portion of said cells, wherein 
said control composition is essentially identical to said test composition except that said 
control composition does not include said test compound; 

c) measuring the level of efflux of substrate from each portion of said cells by 

separately combining said first and said second portions of said cells with an efflux media, 
said efflux media comprising an ABCA1 acceptor, and measuring the amount of substrate 
that associates with said ABCA1 acceptor in each of said portions of said cells: and 

ed) comparing the level of efflux of substrate from said first portion of said 
cells to the level of efflux of substrate from said second portion of said cells, wherein a 
change in the level of efflux indicates that said test compound affects the activity of the 
ABCA1 protein. 

I I . The method of claim 1Q I wherein said ABCA1 acceptor is apo-AI, apo- 
AIV, apo-CI, apo-CII, apo-CLQ, apo-E, apo-E2, apo-E3, apo-E4, or a synthetic 
amphipathic peptide representing the alpha-helical domain of an apoprotein. 

13. The method of claim ±Q 1 wherein the amount of substrate that associates 
with an acceptor for ABCA1 is measured by determining the amount of labeled substrate 
that appears in the medium of the cells. 
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The Human ATP-Binding Cassette (ABC) 
Transporter Superfamily 

Michael Dean, 1 - 6 Andrey Rzhetsky, 2 - 5 and Rando Allikmets 3 - 4 

' Human Genetics Section, Laboratory of Genomic Diversity, National Cancer institute-Frederick, Frederick, Maryland 21702, 
Departments of 2 Medical Informatics, 3 Ophthalmology, 4 Pathology, and s Columbia Genome Center, Columbia University, 
New York, New York, 10032 USA 

The ATP-binding cassette (ABC) transporter superfamily contains membrane proteins that translocate a variety 
of substrates across extra- and intra-celluiar membranes. Genetic variation in these genes is the cause of or 
contributor to a wide variety of human disorders with Mendelian and complex inheritance, including cystic 
fibrosis, neurological disease, retinal degeneration, cholesterol and bile transport defects, anemia, and drug 
response. Conservation of the ATP-binding domains of these genes has allowed the identification of new 
members of the superfamily based on nucleotide and protein sequence homology. Phyiogenetic analysis is used 
to divide all 48 known ABC transporters into seven distinct subfamilies of proteins. For each gene, the precise 
map location on human chromosomes, expression data, and localization within the superfamily has been 
determined. These data allow predictions to be made as to potential functions or disease phenotypes associated 
with each protein. In this paper, we review the current state of knowledge on all human ABC genes in inherited 
disease and drug resistance. In addition, the availability of the complete Drosophila genome sequence allows the 
comparison of the known human ABC genes with those in the fly genome. The combined data enable an 
evolutionary analysis of the superfamily. Complete characterization of all ABC from the human genome and 
from model organisms will lead to important insights into the physiology and the molecular basis of many 
human disorders. 



ABC Protein and Gene Organization 

The ABC genes represent the largest family of transmembrane 
proteins. These proteins bind ATP and use the energy to drive 
the transport of various molecules across all cell membranes 
(Higgins 1992; Childs and Ling 1994; Dean and Allikmets 
1995). Proteins are classified as ABC transporters based on the 
sequence and organization of their ATP-binding domains, 
also known as nucleotide-binding folds NBFs. The NBFs con- 
tain characteristic motifs, Walker A and B, separated by -90- 
120 amino acids, found in all ATP-binding proteins (Fig. 1). 
ABC genes also contain an additional element, the signature 
C motif, located just upstream of the Walker B site (Hyde et al. 
1990). The functional protein typically contains two NBFs 
and two transmembrane (TM) domains (Fig. 1). The TM do- 
mains contain 6-11 membrane-spanning a-helices and pro- 
vide the specificity for the substrate. The NBFs are located in 
the cytoplasm and transfer the energy to transport the sub- 
strate across the membrane. ABC pumps are mostly unidirec- 
tional. In bacteria, they are predominantly involved in the 
import of essential compounds that cannot be obtained by 
diffusion (e.g., sugars, vitamins, metal ions, etc.) into the cell. 
In eukaryotes, most ABC genes move compounds from the 
cytoplasm to the outside of the cell or into an extracellular 
compartment (endoplasmic reticulum, mitochondria, peroxi- 
some). Most of the known functions of eukaryotic ABC trans- 
porters involve the shuttling of hydrophobic compounds ei- 
ther within the cell as part of a metabolic process or outside 
the cell for transport to other organs, or secretion from the 
body. 
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The eukaryotic ABC genes are organized either as full 
transporters containing two TMs and two NBFs, or as half 
transporters (Hyde et al. 1990). The latter must form either 
homodimers or heterodimers to constitute a functional trans- 
porter. ABC genes are dispersed widely in eukaryotic genomes 
and are highly conserved between species, indicating that 
most of these genes have existed since the beginning of eu- 
karyotic evolution. The genes can be divided into subfamilies 
based on similarity in gene structure (half vs. full transport- 
ers), order of the domains, and on sequence homology in the 
NBF and TM domains. There are seven mammalian ABC gene 
subfamilies, five of which are found in the Saccharomyces cer- 
evisiae genome. 

Overview of Human ABC Gene Subfamilies 
A list of all known human ABC genes is displayed in Table 1. 
This list includes an analysis of recently released genome se- 
quences (Lander et al. 2001; Venter et al. 2001). Several se- 
quences remain in the genome with homology to ABC genes, 
many of which represent pseudogenes. A comprehensive de- 
scription of all ABC genes goes beyond the scope of this re- 
view. Therefore, only a concise summary of each subfamily is 
provided. 

ABCA (ABCI) 

This subfamily comprises 12 full transporters (Table 1) that 
are further divided into two subgroups based on phyiogenetic 
analysis and intron structure (Broccardo et al. 1999). The first 
group includes seven genes dispersed on six different chro- 
mosomes (ABCA1, ABCA2, ABCA3, ABCA4, ABCA7, ABCA J 2, 
ABCA13), whereas the second group contains five genes 
{ABCAS, ABCA6, ABCA8, ABCA9, and ABCA10) arranged in a 
cluster on chromosome 1 7q24. The ABCA subfamily contains 
some of the largest ABC genes, several of which are >2100 
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Figure 1 Diagram of a typical ABC transporter protein. (A) A dia- 
gram of the structure of a representative ABC protein is shown with a 
lipid bi layer in yellow, the transmembrane domains in blue, and the 
nucleotide binding fold in red. Although the most common arrange- 
ment is a full- transporter with motifs arranged N-TM-NBF-TM-NBF-C, 
as shown, NBF-TM-NBF-TM, TM-NBF, and NBF-TM arrangements are 
also found, (ff) The NBF of an ABC gene contains the Walker A and B 
motifs found in all ATP-binding proteins. In addition, a signature or C 
motif is also present. The most common amino acids found in these 
motifs are shown above the diagram; subfamilies often contain char- 
acteristic residues in these and other regions. 



amino acids. Two members of this subfamily, the ABCA1 and 
ABCA4 (ABCR) proteins, have been studied extensively. The 
ABCA1 protein is involved in disorders of cholesterol trans- 
port and high-density lipoproteins (HDL) biosynthesis (see 
below). The ABCA4 protein transports vitamin A derivatives 
in the outer segments of photoreceptor cells and therefore 
performs a crucial step in the visual cycle. 

ABCB (MDR/TAP) 

The ABCB subfamily is unique in that it contains both hill 
transporters and half transporters. Four full transporters and 
seven half transporters are currently identified as members of 
this subfamily. ABCB1 (MDR/PGY1) is the first human ABC 
transporter cloned and characterized through its ability to 
confer a multidrug resistance phenotype to cancer cells. The 
functional sites of ABCB1 include the blood-brain barrier and 
the liver. The ABCB4 and ABCB 11 proteins are both located in 
the liver and are involved in the secretion of bile acids. The 
ABCB2 and ABCB3 (TAP) genes are half transporters that form 
a heterodimer to transport peptides into the endoplasmic re- 
ticulum, which are presented as antigens by the Class I HLA 
molecules. The closest homolog of the TAPs, the ABCB9 half 
transporter, has been localized to lysosomes. The remaining 
four half-transporters, ABCB6, ABCB7, ABCB8 and ABCB10 
localize to the mitochondria, where they function in iron 
metabolism and transport of Fe/S protein precursors. 

ABCC (CFTR/MRP) 

The ABCC subfamily contains 12 full transporters with a di- 
verse functional spectrum that includes ion transport, cell 
surface receptor, and toxin secretion activities. The CFTR pro- 
tein is a chloride ion channel that has a role in all exocrine 
secretions, and mutations in CFTR cause cystic fibrosis (Quin- 
ton 1999). ABCC8 and ABCC9 proteins bind sulfonylurea and 
regulate potassium channels involved in modulating insulin 
secretion. The rest of the subfamily is composed of nine MRP- 
related genes. Of these ABCC1, ABCC2, and ABCC3 transport 
drug conjugates to glutathione and other organic anions. The 
ABCC4, ABCC5, ABCC11 and ABCC12 proteins are smaller 
than the other MRP 1 -like genes and lack an animo-terminal 
domain (Borst et al. 2000) that is not essential for transport 
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function (Bakos et al. 2000). The ABCC4 and ABCC5 proteins 
confer resistance to nucleosides including PMEA and purine 
analogs. 

ABCDIALD) 

The ABCD subfamily contains four genes, in the human ge- 
nome and two each in the Drosophila and yeast genomes. The 
yeast PXA1 and PXA2 products dimerize to form a functional 
transporter involved in very long chain fatty acid oxidation in 
the peroxisome (Shani and Valle 1998). All of the genes en- 
code half transporters that are located in the peroxisome, 
where they function as homo- and/or heterodimers in the 
regulation of very long chain fatty acid transport. 

ABCE (OABP) and ABCF (GCN20) 

The ABCE and ABCF subfamilies contain genes that have ATP- 
binding domains that are clearly derived from ABC transport- 
ers but have no TM domain and are not known to be involved 
in any membrane transport functions. The ABCE subfamily is 
comprised solely of the oligo-adenylate binding protein, a 
molecule that recognizes oligo-adenylate that is produced in 
response to infection by certain viruses. This gene is found in 
multicellular eukaryotes, but not in yeast, suggesting it is part 
of innate immunity. Each ABCF gene contains a pair of NBFs. 
The best-characterized member, the S. cerevisiae GCN20 gene 
mediates the activation of the eIF-2 a-kinase (Marton et al. 
1997) and a human homolog, ABCF1, is associated with the 
ribosome and appears to have a similar role (Tyzack et al. 
2000). 

ABCG (White) 

The human ABCG subfamily is comprised of six 'reverse' half 
transporters that have an NBF at the amino terminus and a 
TM domain at the carboxyl tenninus. The most intensively 
studied ABCG gene is the white locus of Drosophila. The white 
protein, along with brown and scarlet, transport precursors of 
eye pigments (guanine and tryptophan) in the eye cells of the 
fly (Chen et al. 1996). The mammalian ABCG1 gene is in- 
volved in cholesterol transport regulation (Klucken et al. 
2000). Other ABCG genes include ABCG2, a drug resistance 
gene; ABCGS and ABCG8, transporters of sterols in the intes- 
tine and liver; ABCG3, to date found exclusively in rodents; 
and the ABCG4 gene that is expressed predominantly in the 
liver. The functions of the last two genes are unknown. 

ABC Genes and Human Genetic Disease 

Many ABC genes were originally discovered during the posi- 
tional cloning of human genetic disease genes. To date, 14 
ABC genes have been linked to disorders displaying Mende- 
lian inheritance (Klein et al. 1999) (Table 2). As expected from 
the diverse functional roles of ABC genes, the genetic defi- 
ciencies that they cause also vary widely. Because ABC genes 
typically encode structural proteins, all of the disorders are 
recessive, and are attributable to a severe reduction or lack of 
function of the protein. Heterozygous variants in ABC gene 
mutations, however, are being implicated in the susceptibility 
to specific complex disorders. 

Cystic Fibrosis and CFTR 

Cystic fibrosis is the most common fatal childhood disease in 
Caucasian populations, reaching frequencies ranging from 1: 
900 to 1:2500. This corresponds to a carrier frequency of 1: 
15-1:25. The disease is much less common in African and 
Asian populations, where carrier frequencies of 1:100 to 1:200 
have been estimated. The disease frequency correlates with 
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the frequency of the major allele of the CF gene, a deletion of 
three base pairs (AF508). At least two other populations, how- 
ever, have high frequency CF alleles. The W1282X allele is 
found on 51% of the alleles in the Ashkenazi Jewish popula- 
tion and the 1677delTA allele has been found at a high fre- 
quency in Georgians and is also present at elevated level in 
Turkish and Bulgarian populations. This has led several 
groups to hypothesize that these alleles arose through selec- 
tion of an advantageous phenotype in the heterozygotes. It is 
through CFTR that some bacterial toxins such as cholera and 
Escherichia coli cause increased fluid flow in the intestine and 
result in diarrhea. Therefore, several researchers have pro- 
posed that the CF mutations have been selected for in re- 
sponse to these diseases. This hypothesis is supported by stud- 
ies showing that CF homozygotes fail to secrete chloride ions 
in response to a variety of stimulants, and a study in mice in 
which heterozygous null animals showed reduced intestinal 



fluid secretion in response to cholera toxin (Gabriel et al. 
1993). CFTR is also the receptor for Salmonella typhimitrium 
and implication in the innate immunity to Pseudomonas ae- 
ruginosa (Pier et al. 1998). 

Patients with two severe CFTR alleles like AF508 typically 
display severe diseases with inadequate secretion of pancre- 
atic enzymes leading to nutritional deficiencies, bacterial in- 
fections of the lung, and obstruction of the vas deferens, lead- 
ing to male infertility. Patients with at least one partially func- 
tional allele display enough residual pancreatic function to 
avoid the major nutritional and intestinal deficiencies (Dean 
et al. 1990) and subjects with very mild alleles display only 
congenital absence of the vas deferens with none of the other 
symptoms of CF. Recently, heterozygotes of CF mutations 
have been found to have an increased frequency of pancre- 
atitis (Cohn et al. 1998) and bronchiectasis (PignattJ et al. 
199S). Therefore, there is a spectrum of severity in the phe- 
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notypes caused by this gene that is inversely related to the 
level of CFTR activity. Clearly, other modifying genes and the 
environment also affect disease severity, particularly the pul- 
monary phenotypes. 

Adrenoieukodystrophy 

Adrenoleukodystrophy (ALD) is an X-linked recessive disorder 
characterized by neurodegenerative phenotypes with onset 
typically in late childhood (Mosser et al. 1993). Adrenal defi- 
ciency commonly occurs and the presentation of ALD is 
highly variable. AMN, childhood ALD and adult onset forms 
are recognized, but there is no apparent correlation to ABCD2 
alleles. ALD patients have an accumulation of unbranched, 
saturated fatty acids with a chain length of 24-30 carbons, in 
the cholesterol esters of the brain and in adrenal cortex. The 
ALD protein is located in the peroxisome, where it is believed 
to be involved in the transport of very long chain fatty acids. 

Sulfonylurea Receptor 

The ABCC8 gene is a high-affinity receptor for the drug sul- 
fonylurea. Sulfonylureas are a class of drugs widely used to 
increase insulin secretion in pa- 
tients with non-insulin-dependent 
diabetes. These drugs bind to the 
ABCC8 protein and inhibit an asso- 
ciated potassium channel. Familial 
persistent hyperinsulinemic hypo- 
glycemia of infancy is an autosomal 
recessive disorder in which subjects 
display unregulated insulin secre- 
tion. The disease was mapped to 
Ilpl5-pl4 by linkage analysis, and 
mutations in the ABCC8 gene are 
found in PHHI families (Thomas et 
al. 1995). The ABCC8 gene has also 
been implicated in insulin response 



in Mexican-American subjects (Goksel et al. 1998) and in type 
II diabetes in French Canadians (Reis et al. 2000) but not in a 
Scandinavian cohort (Altshuler et al. 2000). 

Bile Salt Transport Disorders 

Several ABC transporters are specifically expressed in the liver, 
have a role in the secretion of components of the bile, and are 
responsible for several forms of progressive familial intrahe- 
patic cholestasis (PFIC). PFICs are a heterogeneous group of 
autosomal recessive liver disorders, characterized by early on- 
set of cholestasis that leads to liver cirrhosis and failure 
(Alonso et al. 1994). The ABCB4 (PGY3) gene transports phos- 
phatidylcholine across the canalicular membrane of hepato- 
cytes (van Helvoort et al. 1996). Mutations in this gene cause 
PFIC3 (Deleuze et al. 1996; de Vree et al. 1998) and are asso- 
ciated with intrahepatic cholestasis of pregnancy (Dixon et al. 
2000). The rat Abcc2 gene was found to have a frame-shift 
mutation in the strain defective in canalicular multispecific 
organic anion transport, the TR~ rat (Paulusma et al. 1996). 
The TR" rat is an animal model of Dubin-Johnson syndrome 
and mutations in ABCC2 have been identified in Dubin- 
Johnson syndrome patients (Wada et al. 1998). The ABCC2 
protein is expressed on the canalicular side of the hepatocyte 
and mediates organic anion transport. The ABCB1 1 gene was 
originally identified based on homology to ABCB1 (Childs et 
al. 1995). ABCBU is highly expressed on the liver canalicular 
membrane and has been shown to be the major bile salt ex- 
port pump. Mutations in ABCB11 are found in patients with 
PFIC2 (Strautnieks et al. 1998). 

Retinal Degeneration and ABCA4 

The ABCA4 gene is expressed exclusively in photoreceptors 
where it transports retinol (vitamin A) derivatives from the 
photoreceptor outer segment disks into the cytoplasm (Allik- 
mets et al. 1997). The chromophore of a visual pigment rho- 
dopsin, retinal, or conjugates with phospholipids are the 
likely substrates for ABCA4, as they stimulate the ATP hydro- 
lysis of the protein (Sun et al. 1999). Mice lacking Abca4 show 
increased all-trans-retinaldehyde (all-trans-RAL) following 
light exposure, elevated phosphatidylethanolamine (PE) in 
outer segments, accumulation of the protonated Schiff base 
complex of all-trans-RAL and PE (N-retinylidene-PE), and 
striking deposition of a major lipofuscin fluorophore (A2-E) in 
retinal pigment epithelium (RPE) (Weng et al. 1999). These 
data suggest that ABCR is an outwardly directed flippase for 
N-retinylidene-PE. 

Mutations in the ABCA4 gene have been associated with 
multiple eye disorders (Allikmets 2000). A complete loss of 
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ABCA4 function leads to retinitis pigmentosa, whereas pa- 
tients with at least one missense allele have Startgardt disease 
(STGD). STGD is characterized by juvenile to early adult onset 
macular dystrophy with loss of central vision. ABCA4 muta- 
tion carriers are also increased in frequency in age-related 
macular degeneration (AMD) patients. AMD patients display 
a variety of phenotypic features, including the loss of central 
vision after the age of 60, The causes of this complex trait are 
poorly understood, but a combination of genetic and envi- 
ronmental factors have a role. The abnormal accumulation of 
retinoids, caused by ABCA4 deficiency has been postulated to 
be one mechanism by which this process could be initiated. 
Defects in ABCA4 lead to an accumulation of retinal deriva- 
tives in the retinal pigment epithelium behind the retina. 

Mitochondrial Iron Homeostasis 

Several half transporters of the MDR/TAP subfamily have 
been localized to the inner membrane of the mitochondria. 
The yeast ortholog of ABCB7, Atml, has been implicated in 
mitochondrial iron homeostasis, as a transporter in the bio- 
genesis of cytosolic Fe/S proteins (Kispal et al. 1997). Two 
distinct missense mutations in ABCB7 are associated with the 
X-linked sideroblastic anemia and ataxia (XLSA/A) phenotype 
(Allikmets et al. 1999). Three more half transporters from this 
subfamily, ABCB6, ABCB8 and ABCB10 have also been local- 
ized to mitochondria (Table 1). 

Sterol Transport Deficiencies 

Tangier disease is characterized by deficient efflux of lipids 
from peripheral cells, such as macrophages, and a very low 
level of HDL. The disease is caused by alterations in the 
ABCA1 gene, implicating this protein in the pathway of re- 
moval of cholesterol and phospholipids onto HDL (Young 
and Fielding 1999). Patients with hypolipidemia have also 
been described that are heterozygous for ABCA1 mutations, 
suggesting that ABCA1 variations may have a role in regulat- 
ing the level of HDLs in the blood (Marcil et al. 1999). 

Subsequently, the sterol-dependent regulation of ABCA1 
expression was shown (Langmann et al. 1999). Current mod- 
els for ABCA1 function place it at the plasma membrane 
where it mediates the transfer of phospholipid and choles- 
terol onto lipid-poor apolipoproteins to form nascent HDL 
particles. The ABCA1 -mediated efflux of cholesterol is regu- 
lated by nuclear hormone receptors, such as oxysterol recep- 
tors (LXRs) and the bile acid receptor (FXR), as heterodimers 
with retinoid X receptors (RXRs) (Repa et al. 2000). 

Recently, two half-transporter genes, ABCGS and ABCG8 
were characterized (Berge et al. 2000; Lee et al. 2001), located 
head-to-head on the human chromosome 2pl5-pl6, and 
regulated by the same promoter. These genes are both mu- 
tated in families with sitosterolemia, a disorder characterized 
by defective transport of plant and fish sterols and choles- 
terol. Most likely, the two half-transporters form a functional 
heterodimer. The ABCG1 gene is also regulated by cholesterol 
(Klucken et al. 2000) and ABCG4 is highly expressed in the 
liver, suggesting that these two genes may also be involved in 
cholesterol transport (Table 1). 

Multidrug Resistance 

Cells exposed to toxic compounds can develop resistance by a 
number of mechanisms including decreased uptake, increased 
detoxification, alteration of target proteins, or increased ex- 
cretion. Several of these pathways can lead to multidrug re- 
sistance (MDR), in which the cell is resistant to several drugs 



in addition to the initial compound. This is a particular limi- 
tation to cancer chemotherapy and the MDR cell often dis- 
plays other properties, such as genome instability and loss of 
checkpoint control, which complicate further therapy. ABC 
genes have an important role in MDR and at least six genes are 
associated with drug transport (Table 3). 

ABCB1 

The best characterized ABC drug pump is the ABCB1 gene, 
formerly known as MDR1 or PGY1. ABCB1 was the first hu- 
man ABC transporter cloned and characterized through its 
ability to confer a multidrug resistance phenotype to cancer 
cells that had developed resistance to chemotherapy drugs 
(Juliano and Ling 1976). ABCB1 has been demonstrated to be 
a promiscuous transporter of hydrophobic substrates, hydro- 
phobic drugs including drugs including colchicine, VP16, 
adriamycin and vinblastine as well as lipids, steroids, xenobi- 
otics, and peptides (for review, see Ambudkar 1998). The gene 
is thought to have an important role in removing toxic me- 
tabolites from cells, but is also expressed in cells at the blood- 
brain barrier and presumably has a role in transporting com- 
pounds into the brain that cannot be delivered by diffusion. 
ABCB1 also affects the pharmacology of the drugs that are 
substrates and a common polymorphism in the gene affects 
digoxin uptake (Hoffmeyer et al. 2000). 

ABCCI 

The ABCCI gene was identified in the small-cell lung carci- 
noma cell line NCI-H69, a multidrug resistant cell that did not 
overexpress ABCB1 (Cole et al. 1992). The ABCCI pump con- 
fers resistance to doxorubicin, daunorubicin, vincristine, col- 
chicines and several other compounds, a very similar profile 
to that of ABCB1. Unlike ABCB1, however, ABCCI transports 
drugs that are conjugated to glutathione by the glutathione 
reductase pathway (Borst et al. 2000). ABCCI can also trans- 
port leukotrienes, such as leukotriene C4 (LTC4). LTC4 is an 
important signaling molecule for the migration of dendritic 
cells. Migration of dendritic cells from the epidermis to lym- 
phatic vessels is defective in Abccl~ , ~ mice (Robbiani et al. 
2000). 

ABCG2 

Analysis of cell lines resistant to mitoxantrone that do not 
overexpress ABCB1 or ABCCI, led several laboratories to iden- 
tify the ABCG2 {ABCP, MXR1, BCRP) gene as a drug trans- 
porter (Allikmets et al. 1998; Doyle et al. 1998; Miyake et al. 
1999). ABCG2 confers resistance to anthracycline anticancer 
drugs and is amplified or involved in chromosomal translo- 
cations in cell lines selected with topotecan, mitoxantrone, or 
doxorubicin treatment. It is suspected that ABCG2 functions 
as a homodimer because tranfection of the gene into cells 
confers resistance to chemotherapeutic drugs. ABCG2 can 
also transport several dyes such as rhodamine and Hoechst 
33462 and the gene is highly expressed in a subpopulation of 
hematopoetic stem cells (side population) that stain poorly 
for these dyes. The normal function of the gene in these cells, 
however, is unknown. ABCG2 is highly expressed in the tro- 
phoblast cells of the placenta. This suggests that the pump is 
responsible either for transporting compounds into the fetal 
blood supply, or removing toxic metabolites. The gene is also 
expressed in the intestine and ABCG2 inhibitors that could be 
useful in making substrates orally available. 
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Phyiogenetic Analysis of Human ABC Genes 

The identification of the nearly complete set of human ABC 
genes allows a comprehensive phyiogenetic analysis of the 
superfamily. To understand the organization of these genes 
within the subfamilies described previously, an alignment of 
the ATP-binding domains was generated and used for phyio- 
genetic analysis. Figure 2 displays a 
neighbor-joining tree resulting 
from this analysis. The proposed 
nomenclature of ABC transporters 
is in excellent agreement with the 
phyiogenetic trees obtained. In par- 
ticular, all major ABC transporter 
families are represented in the hu- 
man tree by stable clusters with 
high bootstrap values. 

This analysis provides compel- 
ling evidence for frequent domain 
duplication of ATP-binding do- 
mains in ABC transporters. Virtu- 
ally invariably, both ATP-binding 
domains within a gene are more 



closely related to each other than to ATP-binding domains 
from ABC transporter genes of other subfamilies. This could 
be explained by a concerted evolution of domains within the 
same gene, but this seems unlikely because the two domains 
within each gene are substantially diverged. Therefore, it ap- 
pears that duplication of ATP-binding domains within major 
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Figure 2 Phyiogenetic tree of the hu- 
man ABC genes. ATP-binding domain 
proteins were identified using the model 
ABC_tran (accession PF00005) of the 
pfam database (Bateman et al. 1999). 
The hmmsearch program from the 
HMMER package (Eddy 1998) and a set 
of custom-made service scripts were 
used to extract ATP-binding domains 
from all protein sequences of interest. 
Note that some proteins analyzed con- 
tain two ATP-binding domains (denoted 
on the figures as I and II), whereas oth- 
ers contained only one ATP-binding do- 
main. Alignments were generated with 
the hidden Markov model (Eddy 1 995) 
based on the HMMALIGN program using 
the ABC_tran model. The resulting mul- 
tiple alignment was analyzed with 
njboot (N. Takezaki, pers. comm.) 
implementing the neighbor-joining 
tree-making algorithm (Saitou and Nei 
1 987) and the number at the branch of 
the nodes represents the value from 1 00 
replications. The distance measure be- 
tween sequences used for tree making 
was the Poisson correction for multiple 
hits (Zuckerkandl and Pauling 1 965). To 
verify the position of the previously un- 
known subgroup of Drosophila genes 
(CG6162, CG6162, and CGI J 747), they 
were aligned with a representative of 
each of the human subfamilies. Because 
some of the human proteins had two 
ATP-binding domains, the set contained 
three Drosophila and 12 human se- 
quences. The ITT (Jones et at. 1992) 
model as defined in the MOLPHY pack- 
age with the 'star decomposition' op- 
tion was employed. The tentative best 
tree (the total number of possible trees 
for 1 5 sequences is too large for exhaus- 
tive search through all these trees) was 
then used for local maximum likelihood 
search through the surrounding tree to- 
pologies. 
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ABC families was attributable to several independent duplica- 
tion events rather than a single ancestral duplication. 

Drosophila ABC genes 

To begin to understand the organization and evolution of the 
Drosophila ABC genes, the Celera (Myers et al. 2000) and Fly- 
Base databases were searched for sequences by a combination 
of BLAST searches and analysis of the annotation already pres- 
ent in the databases. Initial subfamily classifications were as- 
signed based on homology and blast scores, and the loca- 



tion of each gene was recorded (Table 4). In total, 56 genes 
were identified and there is at least one representative of each 
of the known mammalian subfamilies (Table 5). To confirm 
the subfamily groupings, the ATP-binding domain amino acid 
sequences were used to perform phylogenetic analyses. A rep- 
resentative tree is shown in Figure 3. As expected, genes from 
the same subfamily cluster together and confirm the initial 
assignments made by inspection. 

As in the human and yeast genomes, the Drosophila ABC 
genes are largely dispersed in the genome. There are four clus- 



" Table 4. ibrosophUa A^C'Gm^^ '^:;r'.^' , v . ■ f Vl ' ^ '.-^\ .. '•- 

^iaV ^Prote^ location (chr: nuc) . Gytd/loc 



■Gene ; 

CG3 rsS \ ■ > o I 

CG2759<S W: V v- . 

vGG?2%^\,A>.; 

CG I 801:, 

CGI 494 



Mutants 



> AAF45509 ^ !'/* AE0034T7 
AAF4582© ; AE003425 
AAM8u1i9^ AEU03486; 

fKH^BSS^: T^^Alo6 s 3489'• V ' 

AE0035T3 - AE0035T3 . 

?AAFS^7 A£063568 
^FS0|3o r :v 'AE663568 
*aAf"508*38 :': -AE0O35.38 ^ 




' CG2969 , 
iCGlhUZ 
CG7806 , 
CG7627 
CGS8Si; 



; AAF52284" 
^XAF52639?* 
^ AAF52648:v 

• - 0/r-; :^;AAF52835v"/ 

CG6214 ? 1*J:..<- • ^J}: AAF53223 ) 

^CGiO^-^>&;^'^ 



AE653611 
AE003620 

>:;AE6G362a 
c AE003626 
i<* AE003627 
AE663637 
AEO0364T. 
AE663661 

AE003668 ; 



;aAF53328 

^■lAAFS3^3v^^ 
, -CG927^^:^'-> >':'■'■ ^^F53950' ^ AEO03668> 
;-*eG87§9^^ •:- 
'-■GG357$S'^fw^^ -•iAA^58?37i\^ VAEO0382O' ' - 

';GGM2fc;>^S:^; r ''.. : . \'A^56fc76^%^-AE063563-"-' : 

: AAF50342 ^ 

: AE0b3544 : 




: -rGT8633>:.?^- 

';-GGHb69^%K^, 
^■GGd:?:^"-^-^^ 

.^GGff«98^^^, 

V^G237fe^^.g;^r 



. .^Ml6a : ^S^>^E^37^y 



MF56384'- 1 > : i/-AE0«53756 
'■ '^A^o^O^^^^AEujOS^e' 
MF56870 v ^ AE003768 



. 609 B 

V- : 696 ; e 
•v/9bl;E : 

761 B 
y ; "61$iE-: ;: 
, : -2556>A ? f 
::618D ' 
" 1500 A 1 
- 17|3k .-• 
151 1 A 

; ;11977C ! 

-^3:G-;-.; 
V>627C*r 
X ^5igf 

•>;58T£<g ; ^ 

■:; : . ;: -*729iQ->: 

.' 832G 
: 705 H 
1487 G 

' : ';::i-327?g^:; 

^v-;i : »9^C^ : 
:: 324'A^.' 
;r.T275 v B'^ 

;V'"130^|^ : 

; v •.i344jC:^ 
1279 B - : 

:y r i3i.3^v,. 

■ -;il382Afe- ; 

1^283!^- 

"0*606$:? 
1 326 B : 

■ >^13d2^;^ 

•?; : .-.::597JC^r 
666 G 
•1463 A ^ 

^-1:666:^:^ 

1307G 

^^8oiiS;BA : ^ 

C>C^7fl*' " 

-i";7d2'<i'>.' 
'i : ^'<502C^ 

>^535:.H^. 

i :808iH • 

r^d346G^: 
1^:^:73aC):n 



• X 252038i254671 ; ; 
X ^5753sL53988^^ : 
X y 39381 3-1i139673T r ; 
X 12363^42-12360802 . : 
X T5454374?! 5450765 . 
X 1 551 3659-r5523896 ■ 
X 1949461 5^19^974^5 

< "X 2:0757^20763^^ 1 

. X 20924^95-^91^580^ 
; X 2089lS265T209uT^78r 

^2L-|2'3@02l-ll7541;; 
^b:i^^2u51^1^06SH'' 

2L 2649300^2658^961 1^ 
v 2L^3'2li^W32C@S24: ; ;. 
t 2L "32l4l56^322Cp0^ f 
V2L 3257267-325948^ 
2t 4251813-4262480/ 
2L 5656028-5653232 ; 
2L 821 2839-821 8079 ~ 
21^231^8256791 ; ; 

; ; 2l^1 0105357^1 00892^2^ 
2L 1 261 91 74-1^2641 593/ 



:1B4--^.-^-"x:". 
10G10 

MsEi^'^^-:;;^ 

M3E18-Fl^iv^ 

;:r8Fi;iF2^;r;;- 

19F1 +jk 

19F2 

19F2 

^•i;9F2^v.^t 

■'■,'2i'B^i-v c /' 
'wb/^yK; .: 

; ;22B3v"i ; >^ : ^ 

:23E4^23E5 
•y23e4^23E5« 
^23F'^ 

24F8 

26A1 - ? 

29A3-A4 

( ;3bE1-30E3 

r '33F2#"' " 



6;^2L;i 3^7-55^ 

^•V- .1 ■2L^ ; i'883S1^i ; 883^9/i ; :>3 

':^2i^2d74lS2l26738317c '■: .3?^:%?^; 
.;.--'^v2R^2o5o ; 0«31^; 



-2R^794<^6^^33l©^^ 
^2R?9^3l5%p9p'll2*2^- :^ 
-2R^52b^94lS20872i5^ 



45DC 
49E1v^ 



'■H;.'. '*' 



y^T622^8056^22li59^ 

; 2R 1 ^7SS6MS4658l$e?^CEt^^ ^ 



HI 




3L 159*7621 ?16021 55 
3L 61 80561 i61 75400 

-3U6l1^6^i^lil^6|^ 
31 88^1^9^8892720^ 
;3L1 1555624^1 1559309; 



; ^3R 73626^5^73^1 411 <v^86Flt 
:^3RTOi58b3|fl^i^42b t 

3R 1 572^%1 5728807" ; 




;3R 24409613-24429503 98F1 : 

t: 3R;:2488^241^248925^ 

^v4>1 5426^1|5146> 101 F^!->^%; 



1162 Genome Research 

www.genome.org 

BEST AVAILABLE COPY 



The Human ABC Transporter Superfamily 




ters of two genes and one cluster of four genes (Table 4). One 
of these clusters (on chromosome 2L, band 37B9) is composed 
of an ABCB and an ABCC gene indicating that this is a chance 
grouping of genes. The remaining clusters are composed of 
genes from the same subfamily and arranged in a head-to-tail 
fashion consistent with gene duplication. Because the clusters 
are themselves dispersed and involve different subfamilies, 
they presumably represent independent gene duplication 
events. 

The best-studied Drosophila ABC genes are the eye pig- 
ment precursor transporters white, scarlet, and brown (w, st, 
and br, respectively). These genes are part of the ABCG sub- 
family and have a unique NBF-TM organization. Surprisingly, 
there are 15 ABCG genes in the fly genome, making this the 
most abundant ABC subfamily. This is in sharp contrast to the 
only five and six known ABCG genes in the human and 
mouse genomes, respectively. The Drosophila ABCG genes are 
highly dispersed in the genome with only two pairs of linked 
genes. In addition, they are quite divergent phylogenetically, 
suggesting that there were many independent and ancient 
gene duplication events. 

Several Drosophila ABCB genes, Mdr49, MdrSO, and 
Mdr6S, have also been well characterized. A fourth member of 
this group, CGI 0226, was identified that is clustered with 
Mdr6S (Table 4). These genes are closely related to the human 
and mouse P-glycoproteins (ABCB1, ABCB4) and disruption 
of Mdr49 results in sensitivity to colchicines (Wu et al. 1991). 

To search for potential gene functions, each region con- 
taining a Drosophila ABC gene was searched for phenotypes 
that have been not assigned to a gene (Table 4). The most 
promising connection is the identification of several eye phe- 
notypes (vin, rose, and cln) in the region of the CG7346 gene. 
Because CG7346 is part of the ABCG family and is therefore 
related to w, st, and br, it is tempting to speculate that muta- 
tions in CG7346 cause one or more of these phenotypes. Be- 
cause ABC genes perform very diverse functions and are asso- 
ciated with varied phenotypes it is hard to gather much ad- 
ditional insight from this analysis. 

Three genes, CG9990, CG6162, and CGI J 147, were iden- 
tified that do not fit into any of the known subfamilies and, 
in fact, are most closely related to ABC genes from bacteria. 
There are no close homologs to these genes in any other eu- 
karyotic genome including worm and plants. These genes are 
within large contiguous sequences and have introns, there^ 
fore they do not represent contamination from bacterial se- 
quences. This group forms a distinct cluster on the Drosophila 
tree. To elucidate the position of this group with respect to 



human ABC transporter families, we produced two datasets 
with nine and 15 sequences, respectively, containing ATP- 
binding domains from the three Drosophila genes and from 
one representative from each of the human families. In the 
smaller subset, only one ATP-binding domain from each hu- 
man gene was used. These two datasets were subjected to 
maximum likelihood analysis, with a heuristic algorithm for 
the larger dataset and with a rigorous exhaustive search for 
the smaller datasets. The result of the analysis clearly indicates 
that the new subfamily of ABC transporter in Drosophila is 
significantly different from all known families of ABC trans- 
porters (data not shown) and might have a yet unidentified 
functional role. We propose designating this new group of 
genes as subfamily H. DNA sequence searches showed that 
although significant homology is present between these three 
genes in both the NBF and TM regions, there is no homology 
in the TM regions with any other eukaryotic or prokaryotic 
ABC proteins. The closest related NBF sequences are all bac- 
terial, such as the E. coli YHIH gene, a ribosomal ATPase. This 
gene shows 36% amino acid identity in the NBF domain. 

The ATP-binding domains in a generic ABC transporter 
appear to be following a birth-and-death process that has 
been described before for other multigene families. In a typi- 
cal birth-and-death process (Ota and Nei 1994; Nei et al. 
1997), repeated genetic segments (in this case domains) ex- 
perience nearly random fluctuation in their number. That is, 
genomic deletions decrease segment number, while genome 
duplications increase it. The phylogenetic trees that we pre- 
sent here for human and Drosophila give clear evidence of the 
birth-and-death evolution of ATP-binding domains in ABC 
transporters, and similar analyses of the yeast genes support 
this. In 5. cerevisiae, there are several G subfamily ABC genes 
(e.g., YOROllw and SNQ2) that have two NBF and TM do- 
mains (Michaelis and Berkower 1995; Decottignies and Gof- 
feau 1997). In contrast, all of the Drosophila and human ABCG 
family genes are half transporters. Therefore, it appears that 
the second ATP-binding domain in animal ABCG genes was 
lost in an ancestral lineage preceding the animal radiation but 
not the animal-fungi split. Similarly, ABCE genes in humans 
are one-domain genes, whereas both Drosophila and yeast 
have two-domain homologs (RLI1 in yeast and CG5651 in 
Drosophila). Therefore, the loss of the second domain appears 
to be a relatively recent event in human ABCE genes, probably 
in the vertebrate lineage. 

Perspectives 

One of the most fascinating findings from the analysis of the 
ABC genes from the human, worm, and fly genomes is the 
remarkable similarity in the number of ABC genes. Although 
additional sequencing and annotation will undoubtedly 
change the exact number somewhat, it is clear that humans 
do not have substantially more ABC genes than much simpler 
eukaryotes. This suggests that there is a core of essential ABC 
genes that are required for all multicellular eukaryotes. These 
higher eukaryotes do have about twice the number of ABC 
genes as does S. cerevisiae, suggesting that the evolution of 
multicellularity was accompanied by the expansion in the 
number of ABC genes. It is clear, however, that certain species 
have expanded some subfamilies more than others. As men- 
tioned above, flies have considerably more ABCG genes than 
humans (15 vs. five), whereas there are at least 12 ABCA genes 
in the human and only 10 in Drosophila. Despite the remark- 
able similarity in gene number, most Drosophila ABC genes do 
not have a clear ortholog in the human genome. The likely 
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Figure 5 Phylogenetic tree of the Drosophila ABC genes. Analysis (see Fig. 2) was performed with all extracted Drosophiia sequences and a 
representative of each human subfamily. 
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exceptions are the ABCD, ABCE, and ABCF subfamilies that 
contain nearly identical gene numbers in the two species. The 
only ABC genes in the human and yeast genomes that are 
documented to have similar function are the ABCD (ALD-like) 
genes that are involved in the transport of very long chain 
fatty acids, and the ATM1/ABCB7 gene that is present in the 
mitochondria and has a role in iron metabolism. 

A major limitation to the understanding of ABC genes is 
the difficulty in obtaining crystal structures. The three- 
dimentional structure of two bacterial NBF proteins has been 
obtained and has greatly improved our understanding of the 
organization and function of this portion of the protein 
(Hung et al. 1998). It is clear that conformational changes in 
the NBF influence substrate binding and transport, and to 
date, this can only be addressed by laborious mutagenesis and 
biochemical experiments (Hafkemeyer et al. 1998). 

Ideally, the worm and fly ABC genes could be used to 
elucidate the function of their human counterparts. To date, 
however, we know considerably more about human ABC gene 
function that we do about either the Drosophila or Caenorhab- 
ditis elegans ABC genes. In fact, the only Drosophila genes ini- 
tially identified based on their function are the eye pigment 
genes. Similarly, only one ABC gene in C. elegans {ced-7) was 
identified based on function (involvement in cell death; Wu 
and Horvitz 1998). It is not obvious why more ABC genes are 
not associated with visible or detectable phenotypes in Dro- 
sophila or C. elegans as many of the human ABC genes affect 
morphology, behavior, lifespan, and fertility and these are 
traits that have been observed and selected for repeatedly in 
these organisms. These model organisms, however, do pro- 
vide an important resource for the future systematic study of 
ABC genes. Analysis of ABC gene expression combined with 
gene disruptions should yield important clues to gene func- 
tion. In addition, the capability of designing suppressor 
screens may allow the identification of pathways that involve 
ABC genes. 
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